Picture for Pranjal Aggarwal

Pranjal Aggarwal

Verus-SpecGym: An Agentic Environment for Evaluating Specification Autoformalization

Add code
May 26, 2026
Viaarxiv icon

On the limits and opportunities of AI reviewers: Reviewing the reviews of Nature-family papers with 45 expert scientists

Add code
May 20, 2026
Viaarxiv icon

Gym-Anything: Turn any Software into an Agent Environment

Add code
Apr 07, 2026
Viaarxiv icon

Reasoning over mathematical objects: on-policy reward modeling and test time aggregation

Add code
Mar 19, 2026
Viaarxiv icon

Propose, Solve, Verify: Self-Play Through Formal Verification

Add code
Dec 20, 2025
Figure 1 for Propose, Solve, Verify: Self-Play Through Formal Verification
Figure 2 for Propose, Solve, Verify: Self-Play Through Formal Verification
Figure 3 for Propose, Solve, Verify: Self-Play Through Formal Verification
Figure 4 for Propose, Solve, Verify: Self-Play Through Formal Verification
Viaarxiv icon

OptimalThinkingBench: Evaluating Over and Underthinking in LLMs

Add code
Aug 18, 2025
Figure 1 for OptimalThinkingBench: Evaluating Over and Underthinking in LLMs
Figure 2 for OptimalThinkingBench: Evaluating Over and Underthinking in LLMs
Figure 3 for OptimalThinkingBench: Evaluating Over and Underthinking in LLMs
Figure 4 for OptimalThinkingBench: Evaluating Over and Underthinking in LLMs
Viaarxiv icon

L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learning

Add code
Mar 06, 2025
Viaarxiv icon

Programming with Pixels: Computer-Use Meets Software Engineering

Add code
Feb 24, 2025
Viaarxiv icon

AlphaVerus: Bootstrapping Formally Verified Code Generation through Self-Improving Translation and Treefinement

Add code
Dec 09, 2024
Viaarxiv icon

RLHF Deciphered: A Critical Analysis of Reinforcement Learning from Human Feedback for LLMs

Add code
Apr 12, 2024
Figure 1 for RLHF Deciphered: A Critical Analysis of Reinforcement Learning from Human Feedback for LLMs
Figure 2 for RLHF Deciphered: A Critical Analysis of Reinforcement Learning from Human Feedback for LLMs
Figure 3 for RLHF Deciphered: A Critical Analysis of Reinforcement Learning from Human Feedback for LLMs
Figure 4 for RLHF Deciphered: A Critical Analysis of Reinforcement Learning from Human Feedback for LLMs
Viaarxiv icon